A Biological Inspired Robotic Auditory System Based on Binaural Perception and Motor Theory

نویسندگان

  • Enzo Mumolo
  • Massimiliano Nolich
چکیده

In this paper we present a novel artificial auditory system for humanoid robots. We address the problem of estimating an articulatory representation of the speech of the talker who is speaking to the robot using our auditory system. According to the motor theory of perception, the articulatory representation is the first step of a robust speech understanding process. The system is composed by two parts, namely a beamforming module and a perception module. The beam-former is two-channel (i.e. dual-microphones) and it is based on the super-directive beam-forming algorithm. The environment is scanned for seeking a sound source; when the direction of the source is found, the reception lobe of the dual-microphone system is steered to that direction and the signal is acquired. The perception module is based on a fuzzy computational model of human vocalization. In summary, the relationships between places of articulation and speech acoustic parameters are represented with fuzzy rules. Starting from the articulatory features, a set of acoustic parameters are generated according to the fuzzy rules. These acoustic parameters are used to generate a synthetic utterance which is compared in the perceptual domain to the corresponding spoken utterance. The goal of that is to estimate the membership degrees of the articulatory features using analysis-by-synthesis and genetic optimization. Introduction It is well known that through the auditory system, living ceatures gather important information about the world in which they live. For lower animals, it may mean to be able to escape from a danger or to catch a prey, for humans it may mean to be able to focus one’s attention on events, such as phone ringing, person talking etc. Robots also greatly benefit from auditory capabilities because their intelligence can be improved by fusing auditory information with the information coming from other sensors such as vision. The aim of this paper is to propose an artificial auditory system that gives a robot the ability to locate sounds sources using binaural perception, and to perceive speech, in terms of articulatory representation, on the basis of the motor theory of perception (Liberman & Mattingly 1985). In Fig. 1 the block diagram of our auditory system is reported. In summary, this paper focuses on the following two auditory capabilities Copyright c © 2006, American Association for Artificial Intelligence (www.aaai.org). All rights reserved. • binaural localization of human talkers • speech perception by articulatory features estimation

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bayesian real-time perception algorithms on GPU Real-time implementation of Bayesian models for multimodal perception using CUDA

In this text we present the real-time implementation of a Bayesian framework for robotic multisensory perception on a graphics processing unit (GPU) using the Compute Unified Device Architecture (CUDA). As an additional objective, we intend to show the benefits of parallel computing for similar problems (i.e. probabilistic grid-based frameworks), and the user-friendly nature of CUDA as a progra...

متن کامل

Transfer from action to perception: The effect of motor-perceptual enrichment

This study investigated the effect of audiovisual integration on action-perception transfer.40 subjects were randomly divided four groups: visual, visual-auditory, control visual and control visual-auditory. Visual groups watched pattern skilled basketball player and other groups in addition to watching pattern skilled basketball player, heard Elbow angular velocity as sonification. In first st...

متن کامل

A Bayesian Binaural System for 3D Sound-Source Localisation

In this text we present a Bayesian system of auditory localisation in distance, azimuth and elevation using binaural cues only. We describe its supporting sensor model and calibration procedure. The binaural system is also integrated in a spatial representation framework for multimodal perception of 3D structure and motion — the Bayesian Volumetric Map (BVM). This solution will enable the imple...

متن کامل

Sensory Behavior of Naval Personnel: M on Aural/binaural Minimum Audible Angle of Auditory Response

This paper considers what one ear contributes to man's perception of his auditory world and evaluates the monaural/binaural role in spatial orientation. Minimum audible angles were determined for monaural listening to moving sounds, and results were compared to similarly obtained binaural data. Much usable directionality existed for the monaural mode even at poor azimuths, and for both modes of...

متن کامل

Effect of Using Image-Schemas on Learning L2 Prepositions and Enhancing Learner Autonomy: A Dynamic System Theory and Cognitive Linguistics-Inspired Approach

This study investigated the effect of applying the dynamic system theory (DST) and cognitive linguistics (CL) insights into grammar instruction on EFL learners’ learning of English prepositions and learner autonomy. Sixty Iranian EFL learners at the lower-intermediate level of language proficiency were randomly assigned to 1 experimental and 1 control group. The 2 groups filled out an autonomy ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006